1. Review of three types of codes
ANSI string we are most familiar with, English occupies one byte, Chinese characters 2 bytes, ending with a \ 0, commonly used in TXT text files.Unicode string. Each character (Chinese character or English letter)
C ++ does not support Unicode, even utf8, unicodeutf8So far, unicode is a common sense, but it is still a headache for some programming languages with a long history. Without the support of third-party libraries, C ++ does not actually effectively
What is the relationship between utf8 and unicode encoding? What is the difference? UTF8 = Unicode Transformation Format -- 8 bit
It is a Unicode transfer format. Converts a Unicode file to a BYTE Transfer Stream.
UTF8 Stream Conversion
UTF8 = = Unicode Transformation Format--8 bit
is the Unicode delivery format. That is, convert the Unicode file into a byte transport stream.
UTF8 Stream Conversion Program:
input:unsigned integer c-the code point of the character to be encoded
UTF8 = = Unicode Transformation Format--8 bit
is a Unicode transfer format. Converts a Unicode file to a byte transfer stream.
UTF8 Flow Conversion Program:
input:unsigned integer c-the code point of the character to is encoded (enter a Unicode
UTF8 = Unicode Transformation Format -- 8 bit
It is a Unicode transfer format. Converts a Unicode file to a BYTE Transfer Stream.
UTF8 Stream Conversion Program:Input: unsigned integer c-the code point of the character to be encoded (enter a
A brief introduction to unicode and utf8 encoding in php, unicodeutf8. Here is a brief introduction to unicode and utf8 encoding in php. unicodeutf8 re-recognizes unicode and utf8 encoding until today, I only know that UTF-8 encoding and Unicode
Copyright Disclaimer: During reprinting, please use hyperlinks to indicate the original source and author information of the article and this statementHttp://dark0729.blogbus.com/logs/51496111.html
ANSI string we are most familiar with, English
Character Set charset: defines the number of characters contained in a set, that is, the characters that belong to the character set and do not belong to the set, such as ASCII, GBK, Unicode. Almost all other character sets contain the ASCII
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.